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Let F{x,y) be the joint distribution function of (X, Y), possessing a probability density 
function f{x,y). Let Fi(x) and F 2 (y) be the marginal distribution functions of X and Y re- 
spectively. Let a. be a quantile of F\(x) and /3 be a quantile of F 2 (y). A random sample 
(Xk, Y k ), ft=l, 2, . . . , n, is drawn and the values on each variate are ordered so thai 
X;<X,' and Y'i<CY'j if i<Cj. Let i and j be the greatest integers such that i/n<Fi(a) and 
j/n<F 2 (p), and let M be the number of elements (X, Y) such that X<X; and F< Y\. The 
joint distribution of (M,X' i} Y]) is obtained and is shown to be asymptotically normal. 
Estimates and confidence limits on the parameters of interest are also given. 



1. Summary 

The exact distribution of quantiles as well as its 
limiting form is well known in the univariate case. 1 
Mood 2 investigated the joint distribution of medians 
in samples from a multivariate population and 
showed that it is asymptotically normal. In this 
paper the exact joint distribution of quantiles and an 
auxiliary statistic in samples from a bivariate popu- 
lation are obtained and it is shown that the joint 
distribution is asymptotically tri variate normal. 
The auxiliary statistic is utilized to estimate the 
correlation coefficient between quantiles and also for 
setting up confidence limits on it. Estimates for an 
ordinate of a univariate probability density function 
(pdf) are also obtained. These are used in setting 
confidence limits on the quantiles. 

2. Introduction 

Let F(x,y) be the distribution function of (X,Y) 
possessing a pdf f(x,y) . Let the marginal distribution 
function of X be denoted by F 1 (x) and that of Y by 
F 2 (y). Let a be a quantile of Fiix) and fi be a 
quantile of F 2 (y). It is assumed that the first and 
second partial derivatives of F(x,y) are continuous 
in a neighborhood of (a, 13) and that f(a,P) f^O. A 
random sample (X k ,Y k ), Jc=l, 2, . . . , n, is drawn 
from F(x,y) . Let the sample values of X be ordered 
so that 

Xi < ^X 2 < ^ . . . ^Xn. 

< Y' n . All such samples 



Y] for different indices i 



Similarly, let Y[<Y 2 . . 

for which X\=X\ or Y\ 

and j may be excluded from consideration, as these 

-.•mi pies form a set of probability zero. Let i and j 

be the integers such that 

i/n<F 1 (a)<(i+l)/n;Jln<F 2 (p)<(j+l)/n. 



♦Contribution from the National Bureau of Standards Boulder Laboratories, 
Boulder, Colorado. 

i Harald Cramer, Mathematical methods of statistics, p. 368 (Princeton Univ. 
Press, Princeton, N.J., 1946). 

3 A. M. Mood, On the joint distribution of medians in samples from a multi- 
variate population, Ann. Math. Stat. 12, 268 (1941). 



Hence i/n=Fi(a)—di/n } j/n=F 2 (l3) — 8 2 ln, <8 U 5 2 < 1. 

If M denotes the number of elements (X,Y) in the 
sm in pie such that X<.X f t and F< Y'j, then M is a 
discrete-valued random variable with possible values 
0, 1, . . . , min (i— 1, j— 1). 

First the exact distribution of (M, X' t , Y]) is ob- 
tained for a fixed n. Then the asymptotic distribu- 
tion of the standardized variates is found when 
n, i,j->™ so that iln-^F^a) and jjn-^F 2 {^). 

3. Joint Distribution of the Three Variables 

Let us take a Euclidean plane (x } y) to represent 
the sample points. Given two numbers x Q and y Q , 
the lines x=x and y=yo divide the plane into four 
quadrants, namely 

Qi={(z, y)- x<x , jy<2/o}, 

#2= {(*,?/): •*<% 2/>2/o}, 
Qs= { (x, y) : x >x Q , y< y }, 

Q±={&, yY> z>% 2/>?/o}. 

Let pi=Pr{(j£, Y)eQ f }, t=l, 2, 3, 4. Then 

p l = Ffo, Vo) , V2 = F x Oo) - F(x , y ) , 

Pz=F 2 (y )-F(xo, y ), p^l-F^x,) 

-F 2 (y )+F(x ,y ). 

X't will fall in (x , x Q +dx^) and Yj will fall in (y , 
yo+dy ) in all samples (X k , Y k ), k=l, . . . , n, such 
that the following statements hold simultaneously: 

(1) i—\ points fall on the left of the line x=x , 
n—i points on the right of it, and one point on it; 

(2) j— 1 points fall below the line y—yo, n—j 
points above it, and one point on it. 

Let (x ,Y') and (X',y ) denote the points which 
fall on x=x and y=y respectively. There are five 
distinct possibilities: 
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Case (1) 
Case (2) 
Case (3) 
Case (4) 
Case (5) 



X'<z and Y'<y , 
X'<Zo*a&Y'>y , 

X'^>x and Y f< Cy , 

X'>x, and F'>y , 

X'=x , in which case there is only 

one point (x ,yo), common to both 

the lines. 



We will consider Pr(M=m, x <X'i <x +rfx 0? y 
t <Y , i <y +dy )=P(m,x ,y () )dx () dyo- If M=m, then, 
in case (1), there will be i—m—2, j—m—2 and 
n—i—j J r / m J r2 points in Q 2j Qz, and Q 4 respectively. 
The probability of such a sample is given by 



Pi(m,x ,yo)dx dy 



n\2)Tpt~ m ~ 2 pi~ n 



bFbF 



ml (i— m— 2) ! ( j— m— 2) ! (n+ m—i—j-\-2) ! 

(3.1) 

where the partial derivatives are evaluated at (x Q ,y ) . 
The contribution to P(m,x ,yo) from other cases 
may be found similarly. The possible values of m 
in each case are determined by the rule that negative 
factorials are not allowed. For example, in case (1), 
the maximum possible value of m is min(i— 2, j—2) 
and the minimum possible value is max (0 ,i-\-j—n 
— 2). It will be assumed that the suffixes 1 and 2 
are so chosen that Fi(a)<F 2 (fi) and hence i<j. 
Thus, with the convention that the terms involving 
negative factorials be replaced with zeros, 

5 
k=l 

m=0,l, . . ., i— 1, — *><x , 2/o<°°, (3.2) 
where Pj is as given in (3.1) and 



where all the partial derivatives are evaluated at 
fro, 2A>). 

Hence the joint pdf of (X^Y'j), p(x ,y Q ), is given by 
#(zo,2/o)=S -P(w,«o,2/o), -°°<£o, 2/o<°°, (3.4) 

772 = 

and the probability distribution of M is given by 

J 00 f* CO 

P(m,x ,yo)dx dyo, 
— 00 %} — 00 

m=0,l, . . ., i— 1. (3.5) 

4. Asymptotic Distribution 

In this section the following will be found: 
CO The limit of the pdf of (U,V), p(u,v) (the no- 
tation p is used generically to denote a pdf) where 



U-- 



n"»(Z;-q)/,(«) 

'[/^(aMl-F.te)}]" 2 



n"»(r;-/3)/,(fl) . 
W#{i-^(/3)}] 1/2 ' 



(»i) The asymptotic probability distribution of 
Q=Mjn as n-*oo . 

The following operations are performed. 
(1) Write 

F(a,®=F, F 1 {a)=F l , F 2 (i3)=F 2 , M«)=fu 



,,.. , dF di* 7 d 2 F 



d 2 ^ 



^j.2 #3, jy.,2 04) 



2> 2 F 



bxby 



=/, 9ilfi = C u g 2 lf2=C 2 , f[(a)=f u f 2 (p)=f 2 , 



p 2 = 



nlp-p}— i rf — ip-+-«-«-i (^ (a*)-^) ^ 



P 3 = 



m!(i— m— 2)!(j— m— l)!(n,+m— i— j'-f 1)! 
n!^-"- 1 ^"" ->pl + ' -1+1 ^ (f 2 (y„) -^) 



m!(i— m — 1)! (j—m—2) \(n-\-m — i— j-\-l)l 
m ! ( i — m — 1 ) ! ( j — m — 1 ) ! ( n + m — i — j ) ! 



^2 ^3 1H J__ 

m\(i— m — l)l(j—m — l)\(n-\-m — i— j'+l)! 



p = yi!p^|- w - 1 ^- w - 1 y;+ wt - +1 f(x ,y ) 



(3.3) 
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where all the partial derivatives are evaluated at 
(a, 13). It is observed that by our assumptions none 
of the quantities f u f 2 , gi, g 2 , and / is equal to zero. 
It is further observed that F< F x since F(afi) < F(a, «> ) . 
Also, 

9i=\ f(a,y)dy<\ f(<x,y)dy=f u 

«/ — CO c/ — 00 

and ^2^/2, so that 0<Ci, c 2 <l. 

(2) For S>2, use Stirling's approximation, 



Sl=^2ir Ss+We-^l + OiS- 1 )]. 



(3) Write 



m/n=q, lln=dq. 



This operation is equivalent to replacing the discrete 
random variable M/n by a continuous random varia- 
ble Q over the range (OjFi—l/n). However, 
\Q-M/n\<l/n. 

If by these operations P (m,Xo,y ) dx dy is trans- 
formed into p(g,Xo,yo)dqdx dy , then 

7>(2,^),2/()) = [^(2,»o,W] n fi r (2,»o,2/o)[ 1 + 0(/i" 1 )], (4.1) 
where 

G(a,x ,y ) 

p 9 1 p% 1 ~ q P$~ 9 pl~ Fi ~ F2+Q 

~a Q (F 1 -g) F ^ Q (F 2 -q)^- Q (l-F 1 -F 2 +gy-^- F ^ Q) 

(4.2) 

rj, _ n^ 2 pl(F 1 -qy /2 (F 2 - gf' 2 fdF dF 

U{q,x () ,y ) ^irf'^p^Yq^l-F.-F.+ qy'^x by 

+P3P^\^-F 1 -F 2 +q)(F 2 -q)-^f^)- d ^)^ 

+P2P^\l-F 1 -F 2 +q)(F l -q)-^(f 2 (y,)^^ 
+P2P3P; 2 a-F 1 -F 2 +qy(F 1 -q)-\F 2 -q)-i 

(/iW-s)(*W-^)} (4.3) 

where all the partial derivatives are evaluated at 
(^o,2/o) • The terms in H arise severally from P lf P 2 , 
P 3 , and P 4 after the factor G n is taken out and are 
given in that order. The contribution from P 5 is of 
order n~ l as compared to the other terms and hence 
absorbed in 0(n~ l ) in (4.1). 
(4) Make the transformations 

„ _ n^ixo-ajf, ri<*(y Q -p)f 2 

u - [Fl{ l_ Fl)] m> v -[F 2 (l-F 2 )]^ 



so that 



dx dy 



_ ^/F l F 2 (l-F 1 )(l^F 2 ) 



nfifc 



dudv 



«*' 1 1 ( 1 V (&#o,2A>) dx Q dy {) (/<i goes into p (q,u,v) dqdudv, 
where p is used generically to denote a pdf and is not 
the same function from equation to equation. 

(5) Expand eacli function of (u,v) in Maclaurin's 
series up to the terms of order n~ l , e.g., 

Pi=F+(x - a )g 1 +(y Q -fl)g i +&^ g 3 +(y^l {h 

+ (x -a)(y -p)j+O(n- 3 '*), 

where x and y are replaced in terms of u and t\ 
Then 

log G=w(q) + (x -a) (<L-F)kiU {k^-F(F 2 -F) (1 
-F 2 )} + (2/o-/3) (q-F)k 1 f 2 {k 2 c i -F(F l 

-F)(l-F 1 )}-Ux -an i n{k.A+(l 



-2e 1 )F(F i -F)(l -F 2 ) + (< I -F)A 1 } -i (y 

-pyhf\{k 2 c\+{l-2c 2 )F{F l -F){\-Fd 
+ (<Z-^M 2 } - (xo-a) (y»-P)kJj2{hc,e 2 
-c 1 F(F l -F)(l-F l )-c 2 F(F 2 -F)(l-F 2 ) 

+F(F l -F)(F i -F) + (q-F)A 3 }+0(n-»% 

Here, c x and c 2 are defined in the beginning of this 
section; A X) A 2 , and A$ are some constants depending 
only on the parameters of F(x,y) and need not he 
specified, as it will be seen presently that (q—F) is 
of order n~ 1/2 ; 

l/k l =F(F l -F)(F 2 -F)(l-F l -F 2 +F), 

k 2 =F 1 F 2 (l-F 1 -F 2 +2F)-F 2 ; 
and 



w 



(ff) = 2logf+(^,-2) log5— ?+(F 2 -q) log§-f 



+ (l-^i-^2+2) log 



1—Fi—Fn+F 
1-Fi-Ft+q 



It is easy to verify that w(q) has an absolute maxi- 
mum at q=F in the range of values of q and w(F) 
=w'(F) = 0. Expanding w(q) in a Taylor's series 
about q=Fj 

w(q) = -\kM4-Fy+0[(q-Fn 
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Each function of q is expanded about q=F and 
the transformation 



t=^kik 2 n(q-F) 



(4.3) 



is made with a similar transformation T on the 
random variable Q. The pdf of (T,U,V),p(t,u,v) 
is then given by 

p(t,u,v) = {2ir)-*'\\- p 2 )-" 2 exp [~i ( t -uh x -vh 2 ) 2 

-^E^y+0(n^)] (4.4) 

where 

_ F-F 1 F 2 

P ^F^O—FJd-Fz) 

h _ ^JF 1 [k 2 c 1 ~F(F 2 -F)(l-F 2 )} 
V(1-p 2 )^(1-^ 2 ) 

. _ Mk 2 c 2 -F(F 1 -F)(l-F 1 )] 

h ~ va-^d-^x) ' ( 5) 

The limits of variation of ^ and a are (—00, °°) and 
those of £ are (—d^n, J rd 2 -yjn) where d x and d 2 
are positive constants independent of n. In fact, 

d 1 =-yjkj 2 max (F,F 1 +F 2 -l), d 2 =^kj 2 (F t -F). 

When n-^oo the joint distribution of (T,U,V) 
is a trivariate normal with mean vector (0,0,0) 
and co variance matrix V obtained from 

1 —hi —h 2 

y-i= -hi hl+il-p 2 )' 1 hih 2 - P (l-p 2 )-i 

.-a, ^2-pa-P 2 )- 1 M+a-P 2 )- 1 j 

(4.6) 

Integrating out t, the asymptotic pdf of (U,V) 
is obtained as 



p(^) = (2 7 r)- 1 (l- P 2 )- 1/2 exp 



-\-v 2 — 2puv 



[- 
} - 



2(1-p 2 ) 



{u 2 



™<u, v<&> } (4.7) 



so that p is the correlation coefficient between U and 
V and hence between X\ and F,-. The following 
theorem may then be stated: 



Theorem. J he asymptotic joint distribution of 
the variates (X' t , Fjf ) is normal with parameters 

EX'i=a, EYl=p, 



Cov (*;,r;)= 



F-FiF 2 

nfif 2 



Integrating p(t,u,v) with respect to ^ and v, it is 
found that the distribution of T is asymptotically 
normal with mean zero and variance 



a 2 T =l+h 2 i+hl+2hih 2 p. 

Hence, from (4.3), Q is asymptotically normally 
distributed with mean F and variance 



cr 2 Q =(kik 2 n)- 1 (l+hl+hl+2h 1 h 2 p). 



(4.8) 



The variate M/n, which is discrete-valued, is 
distinguished from Q, which possesses a pdf. How- 
ever, if q=m/n, 

Pr (M/n< q) ^Pr (Q< q). (4.9) 

5. Confidence Limits on Quantiles 



Since X\ is asymptotically normal with mean a 
and variance 

o* t =F l (l-F 1 )/(nfl), 

confidence limits on a with confidence coefficient 
(1— 7) are given by X'i±Z y <ri. Here, 0<7<1, and 



J" 



-x*/2 



V2tt 



dx=- r 



However, a t remains undetermined unless f x is known. 
Exact nonparametric confidence limits on quantiles 
are readily available. 3 However, an alternate pro- 
cedure is outlined here which provides an asymptotic 
estimate of f x and hence of tr t . 

Consider the joint distribution of (X'i- k ,Xi, 
X' i+ i), where as w-^oo, f- >co in such a way that 
ijn-^Fi while k/n and l/n both tend to zero, k and 
I may be taken to be of order n a , 0<a<Cl. The 
probability that t<X'i<t+dt, *i <XU<*i+<&i, 
t 2 <X' i+ i<t 2 -\-dt 2 , is given by p(t,ti,t 2 )dtdtidt 2 , wher e 



p(^«i,^) = 



(i-k-l)\(k-l)Kl-l)\(n-i-l)\ 



(5.1) 



3 S. S. Wilks, Order statistics, Am. Math. Soc. Bull. 54, 6 (1948); G. E. Noether> 
On confidence limits for quantiles, Ann. Math. Stat. 19, 416 (1948). 
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u= 



Make the following transformations 
■yJnjX't—a)^ 



V^(i--Fi) 



t^l=T (X'i — X ( - k ), 



U 2 =j (X' (+l -Xd> (5.2) 



with corresponding transformations from (t,t h t 2 ) to 
(u,Ui,u 2 ). Use Stirling's approximations for factor- 
ials involving i and n. Expand each function of u 
in Maclaurin's series and let n— ><*> . We finally ob- 
tain the asymptotic joint pdf of (U,U ly U 2 ) given by 



„-u 2 /: 



j)( Uj U u U 2 )=-^= 



m* 



// 



*-i pr kf l 



Ul WiY 



u% 



/1W0 



w^w. 1 * d-Di"' 

— oo<u<co y 0<i^, u 2 <co, (5.3) 

The following statements are true asymptotically. 

(1) (U,U 1 ,U 2 ) form an independent set of variates. 

(2) 2kf l U 1 = 2nf l (X , i —Xi- k ) is a X 2 variate with 2k 
degrees of freedom. 

(3) 2lf 1 U 2 =2nf 1 (X' i+l — X'i) is a X 2 variate with 2/ 
degrees of freedom. 

(4) Hence, 

2n(X' i+ i-X' i . k \f 1 

is a X 2 variate with 2{k-\-l) degrees of freedom, 
distributed independently of X' t . 

(5) Thus, 



W= 



U(k+l) 



(X' t -a)(k+l) 



(5.4) 
is distributed as the ratio of a JV(0,1) variate and 



an independent X 2 variate divided by its number of 
degrees of freedom 2(k J \-l). 

W is independent of the parameter /ij however, 
the distribution function of such a ratio has not been 
tabulated, hence, at present, it is of little practical 
use. However, \jf x can be estimated by 



u z ={ku x +iu 2 )i{k+i) y 



(5.5) 



which has mean value equal to l//i and variance 
(fc+J)-'/f*. Thus 



8i = V^l(l-^)^V» 



(5.6) 



is an unbiased estimator of a t with known distribu- 
tion, still in the asymptotic sense. 

In practice, k and I may be taken of order n 1/2 . 

6. Confidence Limits on F and p 

The asymptotic maximum likelihood estimator of 
F is M/n which has asymptotic variance <r 2 Q given 
by (4.8). Solution of the inequalities 



gLves 



— Z y <r Q <Q—F<Zy(TQ 
h 1 (Q)<F<h 2 (Q) ) 



to obtain asymptotic confidence limits on F with 
confidence coefficient (1— 7). For large n, <r Q may 
be replaced by S Q) where Sq is obtained from the 
right-hand side of (4.8) by replacing F by Q. Thus, 
to order n~ 1/2 y 100(1— 7) percent confidence limits 
for Fare Q±Z y S Q . 

Here, again, there is the difficulty of nuisance 
parameters c x and c 2 . To show how to overcome 
this difficulty, the case of medians is presented. 

When F x =F 2 = 1/2, 



P =4F-1, k x l ^F 2 {\l2-F) 2 , k 2 =F(l/2-F), 
2c 2 -l 



°l= F{l J F) +^ [(2c 1 -l) 2 +(2 C2 -l) 2 +2 P (2c 1 -l) (2c 2 -l)]. 



2n 



16n l 



(6.1) 



Now 0<c 1? c 2 <l, 0<F<ll2, and the expression 
in the square brackets has a maximum equal to 

2+2|p|=max {8F, 4(1— 2^)} for variations of c x and 
c 2 . Thus, 



*l< 



F(l-2F) . m*x(2F,l-2F) 



2% 



An 



<l/(4w). (6.2) 



The last bound is the least upper bound of the 
expression in the middle and is attained when F=Q 
or 1/2. 

Hence Q±Z y /(2^n) are asymptotic nonparametric 



confidence limits on F with confidence coefficient at 
least equal to 1— 7. 

Going back to the general case, it is observed that 



r= 



Q-F Y F 2 



^F l F 2 (l-F l K\-F 2 ) 



(6.3) 



is the asymptotic maximum likelihood estimator of p. 
Confidence limits on p are easily set up as p is a 
linear function of F } and r of Q. 
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A special case. If F(x,y) is a bivariate normal with 
correlation parameter p* then, for F 1 =F 2 =l/2, 

c 1 =c 2 =l/2, h 1 =h 2 =0, 



i? 1 1 i * 



L P~Sir, 



p*= sin -p = sin -(4^-1), 



<r Q = 



F{\-2F) 
2n ' 



and 100(1—7) percent asymptotic confidence limits 
for p* are 

sin[|{4e-l±Z T V89(l-2(2)/7i}], 

as sin is a monotonically increasing function of in 

(-tt/2, tt/2). 

7. Generalization 

The results of this paper can be generalized to the 
case of a set of quantiles in samples from a multi- 
variate population possessing a pdf and satisfying 
certain continuity conditions. From the discussion 
of Mood (see footnote 2) of the joint distribution of 
medians in samples from a multivariate population, 
and from considerations of the generalizations of 
eq (3.3), it is seen that the asymptotic distribution 
of quantiles will be multivariate normal. To be 
more explicit, the joint distribution of a set of quan- 
tiles is obtained as a weighted sum of multinomials 
and multinomials are known to tend to normal under 
suitable linear transformations on the variates. Thus 
to specify the asymptotic distribution only asymptotic 
means and co variances of the variates are needed. 
However, these can be determined by considering 
the marginal bivariate distributions. 

Then the following conjecture may be stated: 



Conjecture. Let F(x u . . ., x v ) be the distribu- 
tion function of (X u . . ., X v ) possessing a pdf 
f(xi, . . ., x v ). Let the marginal distribution of X t 
be denoted by F t (x) and that of {X u Xj) by F tj (x,y) 
and corresponding pdfs by f t (x) and f i}j (x,y). Let 



a*i<«*2 



•<«*r<. 



»=1, 2, 



P 



be r t quantiles of F t (x). Continuity of the partial 
derivatives of F up to order p at the points 
(oLij } . . ., ctpj ) is assumed for all possible values 
of (ji, . . ., j p ). Let a sample of size n be drawn 
and the values of each variate be ordered so that 



X'(il)<X'm< . . . <X'(m), 



= 1,2, . . .,#. 



Let k ijy j=l, 2, . . ., r u f=l, 2, . . ., p be 

the integers such that s 

k ij jn<F i {a iJ X.(k ij +l)ln. 

Then the asymptotic joint distribution of the 
variates {X'tyky)} is normal with parameters 



EX'(ik ij ) = a ij , j~- 



■■ 1, z, . . ., r iy 



i=l, 2, . . .,p; 



Cov {X\ik iJ ),X\lk lv )}= Fi > l{aij ' a t m ) fifajWfaJ , 

nh\<Xij)Ji\<xim) 



.7 = 1,2, 



m = l f 2, 



., r t , i — 1, 2, . . ., p, 
., r h 1=1, 2, . . ., p. 



Here Fi,i{aij,a im ) is to be interpreted in the follow- 
ing way: 

F i}i (a ij ,a im )=F i , i (a im} a ij )=F i (a ij ), j <m. 
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of this paper. 

(Paper 64B3-31) 



150 



